CLAIMS 

What is claimed is: 

1 . A method comprising: 

determining whether any unprocessed data bits for a partial variable length symbol 
exist in a first data block; and 

performing a shift merge operation to merge said unprocessed data bits from said 
first data block with a second data block, wherein a merged data block is formed. 

2. The method of claim 1 further comprising receiving said first data block and said 
second data block from a bitstream. 

3. The method of claim 1 wherein said first data block and said second data block 
are loaded from bitstream data in memory. 

4. The method of claim 2 further comprising extracting a merged variable length 
symbol from said merged data block, said merged variable length symbol comprised of 
said unprocessed data bits and a plurality of data bits from said second data block. 

5. The method of claim 4 wherein said merged variable length symbol is sent to a 
register. 

6. The method of claim 4 further comprising: 

evaluating said merged data block for variable length symbols; 

extracting any whole variable length symbols located in said second data block; 

and 

sending an extracted whole variable length symbols to a register. 

7. The method of claim 6 wherein each of said variable length symbols is comprised 
of at least N-bits of data. 
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8. The method of claim 7 wherein N is a number greater than one. 

9. The method of claim 2 wherein said first data block is held in a first single 
instruction multiple data register. 

10. The method of claim 3 wherein said first single instruction multiple data register 
is a buffer. 

11. The method of claim 2 wherein said second data block is held in a second single 
instruction multiple data register. 

12. The method of claim 1 wherein said shift merge operation is a single instruction 
multiple data type of instruction to cause a parallel shift right merge of data operands 
based on a shift count. 

13. The method of claim 10 wherein said shift merge operation operates on data 
elements at a byte granularity. 

14. The method of clam 10 wherein said shift merge operation operates on data 
elements at a bit granularity. 

15. An apparatus comprising: 

an execution unit to execute a plurality of instructions for a variable length 
decoding algorithm, wherein one of said instructions is a first instruction for a shift 
merge operation, said plurality of instructions to cause said execution unit to: 

determine whether any unprocessed data bits for a partial variable length 
symbol exist in a first data block; and 

perform a shift merge operation to merge said unprocessed data bits from 
said first data block with a second data block, wherein a merged data block is 
formed. 
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16. The apparatus of claim 15 wherein plurality of instructions further cause said 
execution unit to extract a merged variable length symbol from said merged data block, 
said merged variable length symbol comprised of said unprocessed data bits and a 
plurality of data bits from said second data block. 

17. The apparatus of claim 15 wherein said first data block and said second data block 
are received from a bitstream. 

18. The apparatus of claim 16 wherein said plurality of instructions further cause said 
execution unit to: 

evaluate said merged data block for variable length symbols; and 

extract any whole variable length symbols located in said second data block. 

19. The apparatus of claim 18 wherein each of said variable length symbols is 
comprised of at least two bits of data. 

20. The apparatus of claim 19 wherein said first data block and said second data block 
are held in a first single instruction multiple data register and a second single instruction 
multiple data register, respectively. 

21 . The apparatus of claim 20 wherein said first single instruction multiple data 
register is a buffer. 

22. The apparatus of claim 21 wherein said shift merge operation is a single 
instruction multiple data type of instruction to cause a parallel shift right merge of data 
operands based on a shift count. 

23. The apparatus of claim 22 wherein said shift merge operation is to operate on data 
elements at a byte granularity. 

24. An article comprising a machine readable medium that stores a program, said 
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program being executable by a machine to perform a method comprising: 

determining whether any unprocessed data bits for a partial variable length symbol 

exist in a first data block; and 

performing a shift merge operation to merge said unprocessed data bits from said 

first data block with a second data block, wherein a merged data block is formed. 

25. The article of claim 24 wherein said first data block and said second data block 
are to be loaded from a data bitstream. 

26. The article of claim 25 wherein said method further comprises extracting a 
merged variable length symbol from said merged data block, said merged variable length 
symbol comprised of said unprocessed data bits and a plurality of data bits from said 
second data block. 

27. The article of claim 26 wherein each of said variable length symbols is comprised 
of at least two bits of data. 

28. The article of claim 24 wherein said shift merge operation is a single instruction 
multiple data type of instruction to cause a parallel shift right merge of data operands 
based on a shift count. 

29. The article of claim 26 wherein said method further comprises: 

. evaluating said merged data block for variable length symbols; and 
extracting any whole variable length symbols located in said second data block. 

30. A system comprising: 

a memory to store data and instructions; 

a processor coupled to said memory on a bus, said processor operable to perform 
instructions for a variable length decoding algorithm, said processor comprising: 
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a bus unit to receive a sequence of instructions from said memory; 
an execution unit coupled to said bus unit, said execution unit to execute 
said sequence, said sequence to include a first instruction for a shift merge 
operation, said sequence to cause said execution unit to: 

determine whether any unprocessed data bits for a partial variable 
length symbol exist in a first data block; and 

perform a shift merge operation to merge said unprocessed data 
bits from said first data block with a second data block, wherein a 
merged data block is formed. 

3 1 . The system of claim 30 wherein said first data block and said second data block 
are loaded from bitstream data. 

32. The system of claim 30 wherein said plurality of instructions further cause said 
execution unit to extract a merged variable length symbol from said merged data block, 
said merged variable length symbol comprised of said unprocessed data bits and a 
plurality of data bits from said second data block. 

33. The system of claim 32 wherein said plurality of instructions further cause said 
execution unit to: 

evaluate said merged data block for variable length symbols; and 

extract any whole variable length symbols located in said second data block. 

34. The system of claim 33 wherein each of said variable length symbols is comprised 
of at least two bits of data. 

35. The system of claim 34 wherein said first data block and said second data block 
are held in a first single instruction multiple data register and a second single instruction 
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multiple data register, respectively. 

36. The system of claim 35 wherein said shift merge operation is a single instruction 
multiple data type of instruction to cause a parallel shift right merge of data operands 
based on a shift count. 

37. The system of claim 36 wherein at least a single edge for at least one or more of 
said variable length symbols is not aligned at a byte boundary. 

38. The system of claim 37 wherein said shift merge operation is to operate on data 
elements at a bit granularity. 

39. The system of claim 37 wherein said shift merge operation is to operate on data 
elements at a byte granularity. 
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